Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 31076 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 672 |
| Duplicate rows (%) | 2.2% |
| Total size in memory | 5.4 MiB |
| Average record size in memory | 182.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 11 |
| Boolean | 6 |
Reason has constant value "CR" | Constant |
Is_year_start has constant value "False" | Constant |
| Dataset has 672 (2.2%) duplicate rows | Duplicates |
Id has a high cardinality: 3827 distinct values | High cardinality |
Applied is highly correlated with Received | High correlation |
Received is highly correlated with Applied | High correlation |
logapplied is highly correlated with logreceived | High correlation |
logreceived is highly correlated with logapplied | High correlation |
Year is highly correlated with Elapsed | High correlation |
Month is highly correlated with Week and 1 other fields | High correlation |
Week is highly correlated with Month and 1 other fields | High correlation |
Dayofyear is highly correlated with Month and 1 other fields | High correlation |
Elapsed is highly correlated with Year | High correlation |
True_False is highly correlated with Reason and 1 other fields | High correlation |
Reason is highly correlated with True_False and 14 other fields | High correlation |
Area is highly correlated with Reason and 1 other fields | High correlation |
Is_month_end is highly correlated with Reason and 1 other fields | High correlation |
Payment_Method is highly correlated with Reason and 2 other fields | High correlation |
Is_month_start is highly correlated with Reason and 1 other fields | High correlation |
Gender is highly correlated with Reason and 1 other fields | High correlation |
Is_year_start is highly correlated with True_False and 14 other fields | High correlation |
Age is highly correlated with Reason and 2 other fields | High correlation |
AgeGroup is highly correlated with Reason and 2 other fields | High correlation |
Location is highly correlated with Reason and 1 other fields | High correlation |
Payment_Type is highly correlated with Reason and 2 other fields | High correlation |
Year is highly correlated with Reason and 1 other fields | High correlation |
Is_year_end is highly correlated with Reason and 1 other fields | High correlation |
Is_quarter_end is highly correlated with Reason and 1 other fields | High correlation |
Is_quarter_start is highly correlated with Reason and 1 other fields | High correlation |
Ratio is highly skewed (γ1 = -90.54043879) | Skewed |
Dayofweek has 5491 (17.7%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-26 20:46:14.102419 |
|---|---|
| Analysis finished | 2021-04-26 20:47:12.496143 |
| Duration | 58.39 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 1901 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 771.3485648 |
|---|---|
| Minimum | 3 |
| Maximum | 13942 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 150 |
| Q1 | 330 |
| median | 560 |
| Q3 | 1050 |
| 95-th percentile | 2000 |
| Maximum | 13942 |
| Range | 13939 |
| Interquartile range (IQR) | 720 |
Descriptive statistics
| Standard deviation | 607.652693 |
|---|---|
| Coefficient of variation (CV) | 0.7877796378 |
| Kurtosis | 9.271635327 |
| Mean | 771.3485648 |
| Median Absolute Deviation (MAD) | 300 |
| Skewness | 1.710803489 |
| Sum | 23970428 |
| Variance | 369241.7953 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 400 | 579 | 1.9% |
| 300 | 563 | 1.8% |
| 500 | 554 | 1.8% |
| 600 | 548 | 1.8% |
| 450 | 404 | 1.3% |
| 200 | 385 | 1.2% |
| 700 | 384 | 1.2% |
| 350 | 373 | 1.2% |
| 1000 | 372 | 1.2% |
| 1200 | 346 | 1.1% |
| Other values (1891) | 26568 |
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 4 | 1 | |
| 6 | 1 | |
| 10 | 1 | |
| 12 | 1 |
| Value | Count | Frequency (%) |
| 13942 | 1 | |
| 7031 | 1 | |
| 6061 | 1 | |
| 5186 | 1 | |
| 4945 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| F | |
|---|---|
| M | |
| GD | 4 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.000128717 |
| Min length | 1 |
Characters and Unicode
| Total characters | 31080 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | F |
| Value | Count | Frequency (%) |
| F | 19794 | |
| M | 11278 | |
| GD | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| f | 19794 | |
| m | 11278 | |
| gd | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 19794 | |
| M | 11278 | |
| G | 4 | < 0.1% |
| D | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 31080 |
Most frequent character per category
| Value | Count | Frequency (%) |
| F | 19794 | |
| M | 11278 | |
| G | 4 | < 0.1% |
| D | 4 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31080 |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 19794 | |
| M | 11278 | |
| G | 4 | < 0.1% |
| D | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31080 |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 19794 | |
| M | 11278 | |
| G | 4 | < 0.1% |
| D | 4 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| AV | |
|---|---|
| RP |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 62152 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RP |
|---|---|
| 2nd row | AV |
| 3rd row | AV |
| 4th row | RP |
| 5th row | AV |
| Value | Count | Frequency (%) |
| AV | 26795 | |
| RP | 4281 | 13.8% |
| Value | Count | Frequency (%) |
| av | 26795 | |
| rp | 4281 | 13.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 26795 | |
| V | 26795 | |
| R | 4281 | 6.9% |
| P | 4281 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 62152 |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 26795 | |
| V | 26795 | |
| R | 4281 | 6.9% |
| P | 4281 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 62152 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 26795 | |
| V | 26795 | |
| R | 4281 | 6.9% |
| P | 4281 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62152 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 26795 | |
| V | 26795 | |
| R | 4281 | 6.9% |
| P | 4281 | 6.9% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| M | |
|---|---|
| NE | |
| O | |
| PP | |
| U | 1112 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.405908096 |
| Min length | 1 |
Characters and Unicode
| Total characters | 43690 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | PP |
| 3rd row | M |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| M | 13959 | |
| NE | 9240 | |
| O | 3391 | 10.9% |
| PP | 3374 | 10.9% |
| U | 1112 | 3.6% |
| Value | Count | Frequency (%) |
| m | 13959 | |
| ne | 9240 | |
| o | 3391 | 10.9% |
| pp | 3374 | 10.9% |
| u | 1112 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 13959 | |
| N | 9240 | |
| E | 9240 | |
| P | 6748 | |
| O | 3391 | 7.8% |
| U | 1112 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 43690 |
Most frequent character per category
| Value | Count | Frequency (%) |
| M | 13959 | |
| N | 9240 | |
| E | 9240 | |
| P | 6748 | |
| O | 3391 | 7.8% |
| U | 1112 | 2.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43690 |
Most frequent character per script
| Value | Count | Frequency (%) |
| M | 13959 | |
| N | 9240 | |
| E | 9240 | |
| P | 6748 | |
| O | 3391 | 7.8% |
| U | 1112 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43690 |
Most frequent character per block
| Value | Count | Frequency (%) |
| M | 13959 | |
| N | 9240 | |
| E | 9240 | |
| P | 6748 | |
| O | 3391 | 7.8% |
| U | 1112 | 2.5% |
| Distinct | 4633 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 771.3352253 |
|---|---|
| Minimum | 2.6 |
| Maximum | 13941.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 2.6 |
|---|---|
| 5-th percentile | 150 |
| Q1 | 330 |
| median | 560 |
| Q3 | 1050 |
| 95-th percentile | 2000 |
| Maximum | 13941.5 |
| Range | 13938.9 |
| Interquartile range (IQR) | 720 |
Descriptive statistics
| Standard deviation | 607.6522426 |
|---|---|
| Coefficient of variation (CV) | 0.7877926779 |
| Kurtosis | 9.270519592 |
| Mean | 771.3352253 |
| Median Absolute Deviation (MAD) | 300 |
| Skewness | 1.710750861 |
| Sum | 23970013.46 |
| Variance | 369241.248 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 400 | 576 | 1.9% |
| 300 | 559 | 1.8% |
| 600 | 544 | 1.8% |
| 500 | 543 | 1.7% |
| 450 | 399 | 1.3% |
| 700 | 383 | 1.2% |
| 200 | 380 | 1.2% |
| 1000 | 370 | 1.2% |
| 350 | 369 | 1.2% |
| 1200 | 345 | 1.1% |
| Other values (4623) | 26608 |
| Value | Count | Frequency (%) |
| 2.6 | 1 | |
| 4.12 | 1 | |
| 6.12 | 1 | |
| 10 | 1 | |
| 12 | 1 |
| Value | Count | Frequency (%) |
| 13941.5 | 1 | |
| 7031.36 | 1 | |
| 6060.5 | 1 | |
| 5186 | 1 | |
| 4944.77 | 1 |
| Distinct | 3827 |
|---|---|
| Distinct (%) | 12.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| GHI000137495 | 722 |
|---|---|
| GHI000135471 | 657 |
| GHI000084252 | 620 |
| GHI001304576 | 465 |
| GHI000151115 | 459 |
| Other values (3822) |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 372912 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1285 ? |
|---|---|
| Unique (%) | 4.1% |
Sample
| 1st row | GHI000076584 |
|---|---|
| 2nd row | GHI000135471 |
| 3rd row | GHI000159249 |
| 4th row | GHI000844291 |
| 5th row | GHI000441861 |
| Value | Count | Frequency (%) |
| GHI000137495 | 722 | 2.3% |
| GHI000135471 | 657 | 2.1% |
| GHI000084252 | 620 | 2.0% |
| GHI001304576 | 465 | 1.5% |
| GHI000151115 | 459 | 1.5% |
| GHI000140573 | 457 | 1.5% |
| GHI000143720 | 449 | 1.4% |
| GHI000877574 | 424 | 1.4% |
| GHI000275983 | 345 | 1.1% |
| GHI000076584 | 303 | 1.0% |
| Other values (3817) | 26175 |
| Value | Count | Frequency (%) |
| ghi000137495 | 722 | 2.3% |
| ghi000135471 | 657 | 2.1% |
| ghi000084252 | 620 | 2.0% |
| ghi001304576 | 465 | 1.5% |
| ghi000151115 | 459 | 1.5% |
| ghi000140573 | 457 | 1.5% |
| ghi000143720 | 449 | 1.4% |
| ghi000877574 | 424 | 1.4% |
| ghi000275983 | 345 | 1.1% |
| ghi000076584 | 303 | 1.0% |
| Other values (3817) | 26175 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 104660 | |
| 1 | 32496 | 8.7% |
| G | 31076 | 8.3% |
| H | 31076 | 8.3% |
| I | 31076 | 8.3% |
| 5 | 20657 | 5.5% |
| 4 | 19958 | 5.4% |
| 2 | 18532 | 5.0% |
| 3 | 18498 | 5.0% |
| 7 | 18261 | 4.9% |
| Other values (3) | 46622 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 279684 | |
| Uppercase Letter | 93228 | 25.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 104660 | |
| 1 | 32496 | 11.6% |
| 5 | 20657 | 7.4% |
| 4 | 19958 | 7.1% |
| 2 | 18532 | 6.6% |
| 3 | 18498 | 6.6% |
| 7 | 18261 | 6.5% |
| 8 | 17264 | 6.2% |
| 9 | 15291 | 5.5% |
| 6 | 14067 | 5.0% |
| Value | Count | Frequency (%) |
| G | 31076 | |
| H | 31076 | |
| I | 31076 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 279684 | |
| Latin | 93228 | 25.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 104660 | |
| 1 | 32496 | 11.6% |
| 5 | 20657 | 7.4% |
| 4 | 19958 | 7.1% |
| 2 | 18532 | 6.6% |
| 3 | 18498 | 6.6% |
| 7 | 18261 | 6.5% |
| 8 | 17264 | 6.2% |
| 9 | 15291 | 5.5% |
| 6 | 14067 | 5.0% |
| Value | Count | Frequency (%) |
| G | 31076 | |
| H | 31076 | |
| I | 31076 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 372912 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 104660 | |
| 1 | 32496 | 8.7% |
| G | 31076 | 8.3% |
| H | 31076 | 8.3% |
| I | 31076 | 8.3% |
| 5 | 20657 | 5.5% |
| 4 | 19958 | 5.4% |
| 2 | 18532 | 5.0% |
| 3 | 18498 | 5.0% |
| 7 | 18261 | 4.9% |
| Other values (3) | 46622 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| CR |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 62152 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CR |
|---|---|
| 2nd row | CR |
| 3rd row | CR |
| 4th row | CR |
| 5th row | CR |
| Value | Count | Frequency (%) |
| CR | 31076 |
| Value | Count | Frequency (%) |
| cr | 31076 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 31076 | |
| R | 31076 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 62152 |
Most frequent character per category
| Value | Count | Frequency (%) |
| C | 31076 | |
| R | 31076 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 62152 |
Most frequent character per script
| Value | Count | Frequency (%) |
| C | 31076 | |
| R | 31076 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62152 |
Most frequent character per block
| Value | Count | Frequency (%) |
| C | 31076 | |
| R | 31076 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| 25-29 | |
|---|---|
| 30-34 | |
| 20-24 | |
| 35-39 | |
| 40-44 | |
| Other values (8) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.873664564 |
| Min length | 2 |
Characters and Unicode
| Total characters | 151454 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 35-39 |
|---|---|
| 2nd row | 30-34 |
| 3rd row | 50-54 |
| 4th row | 35-39 |
| 5th row | 20-24 |
| Value | Count | Frequency (%) |
| 25-29 | 5227 | |
| 30-34 | 4362 | |
| 20-24 | 4289 | |
| 35-39 | 3607 | |
| 40-44 | 2882 | |
| 45-49 | 2676 | |
| 50-54 | 2160 | |
| 65+ | 1816 | 5.8% |
| 55-59 | 1772 | 5.7% |
| 60-64 | 1361 | 4.4% |
| Other values (3) | 924 | 3.0% |
| Value | Count | Frequency (%) |
| 25-29 | 5227 | |
| 30-34 | 4362 | |
| 20-24 | 4289 | |
| 35-39 | 3607 | |
| 40-44 | 2882 | |
| 45-49 | 2676 | |
| 50-54 | 2160 | |
| 65 | 1816 | 5.8% |
| 55-59 | 1772 | 5.7% |
| 60-64 | 1361 | 4.4% |
| Other values (3) | 924 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 29162 | |
| 4 | 26170 | |
| 5 | 22962 | |
| 2 | 19032 | |
| 3 | 15938 | |
| 0 | 15054 | |
| 9 | 14108 | |
| 6 | 4556 | 3.0% |
| + | 1816 | 1.2% |
| 1 | 1750 | 1.2% |
| Other values (2) | 906 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 120476 | |
| Dash Punctuation | 29162 | 19.3% |
| Math Symbol | 1816 | 1.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 4 | 26170 | |
| 5 | 22962 | |
| 2 | 19032 | |
| 3 | 15938 | |
| 0 | 15054 | |
| 9 | 14108 | |
| 6 | 4556 | 3.8% |
| 1 | 1750 | 1.5% |
| 8 | 826 | 0.7% |
| 7 | 80 | 0.1% |
| Value | Count | Frequency (%) |
| - | 29162 |
| Value | Count | Frequency (%) |
| + | 1816 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 151454 |
Most frequent character per script
| Value | Count | Frequency (%) |
| - | 29162 | |
| 4 | 26170 | |
| 5 | 22962 | |
| 2 | 19032 | |
| 3 | 15938 | |
| 0 | 15054 | |
| 9 | 14108 | |
| 6 | 4556 | 3.0% |
| + | 1816 | 1.2% |
| 1 | 1750 | 1.2% |
| Other values (2) | 906 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 151454 |
Most frequent character per block
| Value | Count | Frequency (%) |
| - | 29162 | |
| 4 | 26170 | |
| 5 | 22962 | |
| 2 | 19032 | |
| 3 | 15938 | |
| 0 | 15054 | |
| 9 | 14108 | |
| 6 | 4556 | 3.0% |
| + | 1816 | 1.2% |
| 1 | 1750 | 1.2% |
| Other values (2) | 906 | 0.6% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| AM | |
|---|---|
| O | |
| C | |
| W | |
| BP | |
| Other values (6) |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 1.614364783 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50168 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AM |
|---|---|
| 2nd row | O |
| 3rd row | EC |
| 4th row | N |
| 5th row | T |
| Value | Count | Frequency (%) |
| AM | 12773 | |
| O | 4202 | 13.5% |
| C | 2898 | 9.3% |
| W | 2458 | 7.9% |
| BP | 2196 | 7.1% |
| NL | 1323 | 4.3% |
| S | 1266 | 4.1% |
| T | 1255 | 4.0% |
| EC | 1086 | 3.5% |
| Wlg | 857 | 2.8% |
| Value | Count | Frequency (%) |
| am | 12773 | |
| o | 4202 | 13.5% |
| c | 2898 | 9.3% |
| w | 2458 | 7.9% |
| bp | 2196 | 7.1% |
| nl | 1323 | 4.3% |
| s | 1266 | 4.1% |
| t | 1255 | 4.0% |
| ec | 1086 | 3.5% |
| wlg | 857 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 12773 | |
| M | 12773 | |
| O | 4202 | 8.4% |
| C | 3984 | 7.9% |
| W | 3315 | 6.6% |
| B | 2196 | 4.4% |
| P | 2196 | 4.4% |
| N | 2085 | 4.2% |
| L | 1323 | 2.6% |
| S | 1266 | 2.5% |
| Other values (4) | 4055 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 48454 | |
| Lowercase Letter | 1714 | 3.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 12773 | |
| M | 12773 | |
| O | 4202 | 8.7% |
| C | 3984 | 8.2% |
| W | 3315 | 6.8% |
| B | 2196 | 4.5% |
| P | 2196 | 4.5% |
| N | 2085 | 4.3% |
| L | 1323 | 2.7% |
| S | 1266 | 2.6% |
| Other values (2) | 2341 | 4.8% |
| Value | Count | Frequency (%) |
| l | 857 | |
| g | 857 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50168 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 12773 | |
| M | 12773 | |
| O | 4202 | 8.4% |
| C | 3984 | 7.9% |
| W | 3315 | 6.6% |
| B | 2196 | 4.4% |
| P | 2196 | 4.4% |
| N | 2085 | 4.2% |
| L | 1323 | 2.6% |
| S | 1266 | 2.5% |
| Other values (4) | 4055 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50168 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 12773 | |
| M | 12773 | |
| O | 4202 | 8.4% |
| C | 3984 | 7.9% |
| W | 3315 | 6.6% |
| B | 2196 | 4.4% |
| P | 2196 | 4.4% |
| N | 2085 | 4.2% |
| L | 1323 | 2.6% |
| S | 1266 | 2.5% |
| Other values (4) | 4055 | 8.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| 0 | |
|---|---|
| 1 | 355 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 31076 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 30721 | |
| 1 | 355 | 1.1% |
| Value | Count | Frequency (%) |
| 0 | 30721 | |
| 1 | 355 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 30721 | |
| 1 | 355 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 31076 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 30721 | |
| 1 | 355 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 31076 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 30721 | |
| 1 | 355 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31076 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 30721 | |
| 1 | 355 | 1.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| MidAge | |
|---|---|
| Adult | |
| Old | |
| Teenage | 98 |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 4.984071309 |
| Min length | 3 |
Characters and Unicode
| Total characters | 154885 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MidAge |
|---|---|
| 2nd row | MidAge |
| 3rd row | Old |
| 4th row | MidAge |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| MidAge | 13527 | |
| Adult | 10342 | |
| Old | 7109 | |
| Teenage | 98 | 0.3% |
| Value | Count | Frequency (%) |
| midage | 13527 | |
| adult | 10342 | |
| old | 7109 | |
| teenage | 98 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 30978 | |
| A | 23869 | |
| l | 17451 | |
| e | 13821 | |
| g | 13625 | |
| M | 13527 | |
| i | 13527 | |
| u | 10342 | 6.7% |
| t | 10342 | 6.7% |
| O | 7109 | 4.6% |
| Other values (3) | 294 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 110282 | |
| Uppercase Letter | 44603 |
Most frequent character per category
| Value | Count | Frequency (%) |
| d | 30978 | |
| l | 17451 | |
| e | 13821 | |
| g | 13625 | |
| i | 13527 | |
| u | 10342 | 9.4% |
| t | 10342 | 9.4% |
| n | 98 | 0.1% |
| a | 98 | 0.1% |
| Value | Count | Frequency (%) |
| A | 23869 | |
| M | 13527 | |
| O | 7109 | 15.9% |
| T | 98 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 154885 |
Most frequent character per script
| Value | Count | Frequency (%) |
| d | 30978 | |
| A | 23869 | |
| l | 17451 | |
| e | 13821 | |
| g | 13625 | |
| M | 13527 | |
| i | 13527 | |
| u | 10342 | 6.7% |
| t | 10342 | 6.7% |
| O | 7109 | 4.6% |
| Other values (3) | 294 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 154885 |
Most frequent character per block
| Value | Count | Frequency (%) |
| d | 30978 | |
| A | 23869 | |
| l | 17451 | |
| e | 13821 | |
| g | 13625 | |
| M | 13527 | |
| i | 13527 | |
| u | 10342 | 6.7% |
| t | 10342 | 6.7% |
| O | 7109 | 4.6% |
| Other values (3) | 294 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| AV | |
|---|---|
| RPU |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.137759042 |
| Min length | 2 |
Characters and Unicode
| Total characters | 66433 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RPU |
|---|---|
| 2nd row | AV |
| 3rd row | AV |
| 4th row | RPU |
| 5th row | AV |
| Value | Count | Frequency (%) |
| AV | 26795 | |
| RPU | 4281 | 13.8% |
| Value | Count | Frequency (%) |
| av | 26795 | |
| rpu | 4281 | 13.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 26795 | |
| V | 26795 | |
| R | 4281 | 6.4% |
| P | 4281 | 6.4% |
| U | 4281 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 66433 |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 26795 | |
| V | 26795 | |
| R | 4281 | 6.4% |
| P | 4281 | 6.4% |
| U | 4281 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 66433 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 26795 | |
| V | 26795 | |
| R | 4281 | 6.4% |
| P | 4281 | 6.4% |
| U | 4281 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 66433 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 26795 | |
| V | 26795 | |
| R | 4281 | 6.4% |
| P | 4281 | 6.4% |
| U | 4281 | 6.4% |
| Distinct | 1901 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.345708332 |
|---|---|
| Minimum | 1.098612289 |
| Maximum | 9.542661146 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 1.098612289 |
|---|---|
| 5-th percentile | 5.010635294 |
| Q1 | 5.799092654 |
| median | 6.327936784 |
| Q3 | 6.956545443 |
| 95-th percentile | 7.60090246 |
| Maximum | 9.542661146 |
| Range | 8.444048857 |
| Interquartile range (IQR) | 1.157452789 |
Descriptive statistics
| Standard deviation | 0.8117956407 |
|---|---|
| Coefficient of variation (CV) | 0.1279282939 |
| Kurtosis | -0.03917302714 |
| Mean | 6.345708332 |
| Median Absolute Deviation (MAD) | 0.5798184953 |
| Skewness | -0.2773979633 |
| Sum | 197199.2321 |
| Variance | 0.6590121622 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.991464547 | 579 | 1.9% |
| 5.703782475 | 563 | 1.8% |
| 6.214608098 | 554 | 1.8% |
| 6.396929655 | 548 | 1.8% |
| 6.109247583 | 404 | 1.3% |
| 5.298317367 | 385 | 1.2% |
| 6.551080335 | 384 | 1.2% |
| 5.857933154 | 373 | 1.2% |
| 6.907755279 | 372 | 1.2% |
| 7.090076836 | 346 | 1.1% |
| Other values (1891) | 26568 |
| Value | Count | Frequency (%) |
| 1.098612289 | 1 | |
| 1.386294361 | 1 | |
| 1.791759469 | 1 | |
| 2.302585093 | 1 | |
| 2.48490665 | 1 |
| Value | Count | Frequency (%) |
| 9.542661146 | 1 | |
| 8.858084222 | 1 | |
| 8.709630082 | 1 | |
| 8.553717966 | 1 | |
| 8.506132244 | 1 |
| Distinct | 4633 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.345670011 |
|---|---|
| Minimum | 0.955511445 |
| Maximum | 9.542625283 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 0.955511445 |
|---|---|
| 5-th percentile | 5.010635294 |
| Q1 | 5.799092654 |
| median | 6.327936784 |
| Q3 | 6.956545443 |
| 95-th percentile | 7.60090246 |
| Maximum | 9.542625283 |
| Range | 8.587113838 |
| Interquartile range (IQR) | 1.157452789 |
Descriptive statistics
| Standard deviation | 0.8118505925 |
|---|---|
| Coefficient of variation (CV) | 0.1279377262 |
| Kurtosis | -0.03452150853 |
| Mean | 6.345670011 |
| Median Absolute Deviation (MAD) | 0.5798184953 |
| Skewness | -0.2779963609 |
| Sum | 197198.0413 |
| Variance | 0.6591013846 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.991464547 | 576 | 1.9% |
| 5.703782475 | 559 | 1.8% |
| 6.396929655 | 544 | 1.8% |
| 6.214608098 | 543 | 1.7% |
| 6.109247583 | 399 | 1.3% |
| 6.551080335 | 383 | 1.2% |
| 5.298317367 | 380 | 1.2% |
| 6.907755279 | 370 | 1.2% |
| 5.857933154 | 369 | 1.2% |
| 7.090076836 | 345 | 1.1% |
| Other values (4623) | 26608 |
| Value | Count | Frequency (%) |
| 0.955511445 | 1 | |
| 1.415853163 | 1 | |
| 1.811562097 | 1 | |
| 2.302585093 | 1 | |
| 2.48490665 | 1 |
| Value | Count | Frequency (%) |
| 9.542625283 | 1 | |
| 8.858135423 | 1 | |
| 8.709547584 | 1 | |
| 8.553717966 | 1 | |
| 8.506085731 | 1 |
| Distinct | 3273 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9999621512 |
|---|---|
| Minimum | 0.8666666667 |
| Maximum | 1.03 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 0.8666666667 |
|---|---|
| 5-th percentile | 0.9996148893 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1.000089955 |
| Maximum | 1.03 |
| Range | 0.1633333333 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.0009418376605 |
|---|---|
| Coefficient of variation (CV) | 0.0009418733093 |
| Kurtosis | 12987.70877 |
| Mean | 0.9999621512 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -90.54043879 |
| Sum | 31074.82381 |
| Variance | 8.870581787 × 107 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 26670 | |
| 0.9982638889 | 24 | 0.1% |
| 0.999034749 | 24 | 0.1% |
| 0.9987593052 | 21 | 0.1% |
| 0.9988262911 | 14 | < 0.1% |
| 0.9991134752 | 14 | < 0.1% |
| 0.9966666667 | 13 | < 0.1% |
| 1.000068695 | 13 | < 0.1% |
| 0.9999086758 | 12 | < 0.1% |
| 0.998940678 | 12 | < 0.1% |
| Other values (3263) | 4259 | 13.7% |
| Value | Count | Frequency (%) |
| 0.8666666667 | 1 | |
| 0.9782608696 | 1 | |
| 0.9817391304 | 1 | |
| 0.9841666667 | 1 | |
| 0.9851612903 | 1 |
| Value | Count | Frequency (%) |
| 1.03 | 1 | |
| 1.02 | 2 | |
| 1.017083333 | 1 | |
| 1.013714286 | 1 | |
| 1.013333333 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.9 KiB |
| 2019 | |
|---|---|
| 2018 | |
| 2020 | |
| 2017 | |
| 2016 | 579 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 124304 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019 |
|---|---|
| 2nd row | 2020 |
| 3rd row | 2017 |
| 4th row | 2018 |
| 5th row | 2020 |
| Value | Count | Frequency (%) |
| 2019 | 8073 | |
| 2018 | 7626 | |
| 2020 | 7535 | |
| 2017 | 7263 | |
| 2016 | 579 | 1.9% |
| Value | Count | Frequency (%) |
| 2019 | 8073 | |
| 2018 | 7626 | |
| 2020 | 7535 | |
| 2017 | 7263 | |
| 2016 | 579 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 38611 | |
| 0 | 38611 | |
| 1 | 23541 | |
| 9 | 8073 | 6.5% |
| 8 | 7626 | 6.1% |
| 7 | 7263 | 5.8% |
| 6 | 579 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 124304 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 38611 | |
| 0 | 38611 | |
| 1 | 23541 | |
| 9 | 8073 | 6.5% |
| 8 | 7626 | 6.1% |
| 7 | 7263 | 5.8% |
| 6 | 579 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 124304 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 38611 | |
| 0 | 38611 | |
| 1 | 23541 | |
| 9 | 8073 | 6.5% |
| 8 | 7626 | 6.1% |
| 7 | 7263 | 5.8% |
| 6 | 579 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 124304 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 38611 | |
| 0 | 38611 | |
| 1 | 23541 | |
| 9 | 8073 | 6.5% |
| 8 | 7626 | 6.1% |
| 7 | 7263 | 5.8% |
| 6 | 579 | 0.5% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.731078646 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.363299266 |
|---|---|
| Coefficient of variation (CV) | 0.4996672069 |
| Kurtosis | -1.15003414 |
| Mean | 6.731078646 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.1005964716 |
| Sum | 209175 |
| Variance | 11.31178195 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 2927 | |
| 11 | 2907 | |
| 7 | 2885 | |
| 5 | 2775 | |
| 9 | 2771 | |
| 6 | 2724 | |
| 10 | 2669 | |
| 3 | 2508 | |
| 12 | 2467 | |
| 2 | 2423 | |
| Other values (2) | 4020 |
| Value | Count | Frequency (%) |
| 1 | 2105 | |
| 2 | 2423 | |
| 3 | 2508 | |
| 4 | 1915 | |
| 5 | 2775 |
| Value | Count | Frequency (%) |
| 12 | 2467 | |
| 11 | 2907 | |
| 10 | 2669 | |
| 9 | 2771 | |
| 8 | 2927 |
| Distinct | 52 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.61755052 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 15 |
| median | 28 |
| Q3 | 40 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.62574856 |
|---|---|
| Coefficient of variation (CV) | 0.5295816712 |
| Kurtosis | -1.168248914 |
| Mean | 27.61755052 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.09976652774 |
| Sum | 858243 |
| Variance | 213.9125209 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 776 | 2.5% |
| 49 | 743 | 2.4% |
| 51 | 715 | 2.3% |
| 38 | 705 | 2.3% |
| 35 | 703 | 2.3% |
| 50 | 698 | 2.2% |
| 46 | 690 | 2.2% |
| 24 | 690 | 2.2% |
| 27 | 685 | 2.2% |
| 34 | 683 | 2.2% |
| Other values (42) | 23988 |
| Value | Count | Frequency (%) |
| 1 | 214 | 0.7% |
| 2 | 540 | |
| 3 | 542 | |
| 4 | 568 | |
| 5 | 500 |
| Value | Count | Frequency (%) |
| 52 | 216 | 0.7% |
| 51 | 715 | |
| 50 | 698 | |
| 49 | 743 | |
| 48 | 776 |
Day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.84985198 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 9 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.601931361 |
|---|---|
| Coefficient of variation (CV) | 0.5427136717 |
| Kurtosis | -1.133381584 |
| Mean | 15.84985198 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.003603339528 |
| Sum | 492550 |
| Variance | 73.99322315 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 1248 | 4.0% |
| 13 | 1198 | 3.9% |
| 17 | 1170 | 3.8% |
| 21 | 1117 | 3.6% |
| 11 | 1106 | 3.6% |
| 12 | 1090 | 3.5% |
| 19 | 1082 | 3.5% |
| 5 | 1055 | 3.4% |
| 16 | 1042 | 3.4% |
| 18 | 1032 | 3.3% |
| Other values (21) | 19936 |
| Value | Count | Frequency (%) |
| 1 | 920 | |
| 2 | 866 | |
| 3 | 908 | |
| 4 | 912 | |
| 5 | 1055 |
| Value | Count | Frequency (%) |
| 31 | 573 | |
| 30 | 991 | |
| 29 | 889 | |
| 28 | 969 | |
| 27 | 1017 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.100752993 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 5491 |
| Zeros (%) | 17.7% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.419840363 |
|---|---|
| Coefficient of variation (CV) | 0.6758721127 |
| Kurtosis | -1.246559229 |
| Mean | 2.100752993 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.04535549859 |
| Sum | 65283 |
| Variance | 2.015946657 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 6578 | |
| 3 | 6501 | |
| 1 | 6204 | |
| 2 | 6082 | |
| 0 | 5491 | |
| 5 | 220 | 0.7% |
| Value | Count | Frequency (%) |
| 0 | 5491 | |
| 1 | 6204 | |
| 2 | 6082 | |
| 3 | 6501 | |
| 4 | 6578 |
| Value | Count | Frequency (%) |
| 5 | 220 | 0.7% |
| 4 | 6578 | |
| 3 | 6501 | |
| 2 | 6082 | |
| 1 | 6204 |
| Distinct | 359 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 189.5892972 |
|---|---|
| Minimum | 3 |
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 101 |
| median | 193 |
| Q3 | 276 |
| 95-th percentile | 345 |
| Maximum | 365 |
| Range | 362 |
| Interquartile range (IQR) | 175 |
Descriptive statistics
| Standard deviation | 102.4193942 |
|---|---|
| Coefficient of variation (CV) | 0.5402171729 |
| Kurtosis | -1.163665309 |
| Mean | 189.5892972 |
| Median Absolute Deviation (MAD) | 87 |
| Skewness | -0.09383894566 |
| Sum | 5891677 |
| Variance | 10489.7323 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 331 | 157 | 0.5% |
| 354 | 148 | 0.5% |
| 318 | 147 | 0.5% |
| 241 | 147 | 0.5% |
| 324 | 146 | 0.5% |
| 73 | 144 | 0.5% |
| 52 | 143 | 0.5% |
| 150 | 141 | 0.5% |
| 51 | 141 | 0.5% |
| 269 | 140 | 0.5% |
| Other values (349) | 29622 |
| Value | Count | Frequency (%) |
| 3 | 40 | |
| 4 | 56 | |
| 5 | 30 | |
| 6 | 51 | |
| 7 | 51 |
| Value | Count | Frequency (%) |
| 365 | 47 | |
| 364 | 40 | |
| 363 | 21 | |
| 362 | 34 | |
| 361 | 47 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| False | |
|---|---|
| True | 1028 |
| Value | Count | Frequency (%) |
| False | 30048 | |
| True | 1028 | 3.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| False | |
|---|---|
| True | 920 |
| Value | Count | Frequency (%) |
| False | 30156 | |
| True | 920 | 3.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| False | |
|---|---|
| True | 233 |
| Value | Count | Frequency (%) |
| False | 30843 | |
| True | 233 | 0.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| False | |
|---|---|
| True | 216 |
| Value | Count | Frequency (%) |
| False | 30860 | |
| True | 216 | 0.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| False | |
|---|---|
| True | 29 |
| Value | Count | Frequency (%) |
| False | 31047 | |
| True | 29 | 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 31076 |
| Distinct | 1063 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1545997227 |
|---|---|
| Minimum | 1480550400 |
| Maximum | 1606694400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 242.9 KiB |
Quantile statistics
| Minimum | 1480550400 |
|---|---|
| 5-th percentile | 1487548800 |
| Q1 | 1513900800 |
| median | 1546905600 |
| Q3 | 1576627200 |
| 95-th percentile | 1601942400 |
| Maximum | 1606694400 |
| Range | 126144000 |
| Interquartile range (IQR) | 62726400 |
Descriptive statistics
| Standard deviation | 36700548.46 |
|---|---|
| Coefficient of variation (CV) | 0.02373907781 |
| Kurtosis | -1.180712162 |
| Mean | 1545997227 |
| Median Absolute Deviation (MAD) | 31190400 |
| Skewness | -0.05312932608 |
| Sum | 4.804340982 × 1013 |
| Variance | 1.346930257 × 1015 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1597104000 | 62 | 0.2% |
| 1593475200 | 61 | 0.2% |
| 1604966400 | 58 | 0.2% |
| 1606435200 | 56 | 0.2% |
| 1487203200 | 56 | 0.2% |
| 1487289600 | 55 | 0.2% |
| 1576022400 | 54 | 0.2% |
| 1542931200 | 54 | 0.2% |
| 1513209600 | 54 | 0.2% |
| 1573171200 | 54 | 0.2% |
| Other values (1053) | 30512 |
| Value | Count | Frequency (%) |
| 1480550400 | 28 | |
| 1480636800 | 26 | |
| 1480896000 | 32 | |
| 1480982400 | 21 | |
| 1481068800 | 30 |
| Value | Count | Frequency (%) |
| 1606694400 | 47 | |
| 1606521600 | 6 | < 0.1% |
| 1606435200 | 56 | |
| 1606348800 | 45 | |
| 1606262400 | 37 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 510.0 | F | RP | NE | 510.00 | GHI000076584 | CR | 35-39 | AM | 0 | MidAge | RPU | 6.234411 | 6.234411 | 1.000000 | 2019 | 3 | 11 | 11 | 0 | 70 | False | False | False | False | False | False | 1552262400 |
| 1 | 252.0 | M | AV | PP | 252.00 | GHI000135471 | CR | 30-34 | O | 0 | MidAge | AV | 5.529429 | 5.529429 | 1.000000 | 2020 | 4 | 15 | 8 | 2 | 99 | False | False | False | False | False | False | 1586304000 |
| 2 | 140.0 | M | AV | M | 140.00 | GHI000159249 | CR | 50-54 | EC | 0 | Old | AV | 4.941642 | 4.941642 | 1.000000 | 2017 | 9 | 38 | 21 | 3 | 264 | False | False | False | False | False | False | 1505952000 |
| 3 | 380.0 | M | RP | NE | 380.00 | GHI000844291 | CR | 35-39 | N | 0 | MidAge | RPU | 5.940171 | 5.940171 | 1.000000 | 2018 | 7 | 27 | 2 | 0 | 183 | False | False | False | False | False | False | 1530489600 |
| 4 | 320.0 | F | AV | NE | 320.00 | GHI000441861 | CR | 20-24 | T | 0 | Adult | AV | 5.768321 | 5.768321 | 1.000000 | 2020 | 4 | 18 | 30 | 3 | 121 | True | False | False | False | False | False | 1588204800 |
| 5 | 1000.0 | F | AV | PP | 1000.00 | GHI000153867 | CR | 20-24 | AM | 0 | Adult | AV | 6.907755 | 6.907755 | 1.000000 | 2018 | 10 | 42 | 19 | 4 | 292 | False | False | False | False | False | False | 1539907200 |
| 6 | 310.0 | M | AV | NE | 310.00 | GHI000079891 | CR | 20-24 | BP | 0 | Adult | AV | 5.736572 | 5.736572 | 1.000000 | 2017 | 3 | 12 | 20 | 0 | 79 | False | False | False | False | False | False | 1489968000 |
| 7 | 640.0 | F | AV | M | 640.00 | GHI001674496 | CR | 20-24 | O | 0 | Adult | AV | 6.461468 | 6.461468 | 1.000000 | 2019 | 2 | 6 | 4 | 0 | 35 | False | False | False | False | False | False | 1549238400 |
| 8 | 349.0 | M | AV | PP | 349.03 | GHI000818689 | CR | 60-64 | AM | 0 | Old | AV | 5.855072 | 5.855158 | 1.000086 | 2018 | 10 | 40 | 2 | 1 | 275 | False | False | False | False | False | False | 1538438400 |
| 9 | 320.0 | F | AV | NE | 320.00 | GHI001867266 | CR | 30-34 | BP | 0 | MidAge | AV | 5.768321 | 5.768321 | 1.000000 | 2020 | 1 | 3 | 15 | 2 | 15 | False | False | False | False | False | False | 1579046400 |
Last rows
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 31066 | 300.0 | F | AV | NE | 300.00 | GHI000219698 | CR | 35-39 | O | 0 | MidAge | AV | 5.703782 | 5.703782 | 1.000000 | 2020 | 8 | 32 | 4 | 1 | 217 | False | False | False | False | False | False | 1596499200 |
| 31067 | 1050.0 | F | RP | PP | 1050.00 | GHI000141841 | CR | 40-44 | AM | 0 | MidAge | RPU | 6.956545 | 6.956545 | 1.000000 | 2019 | 11 | 46 | 12 | 1 | 316 | False | False | False | False | False | False | 1573516800 |
| 31068 | 720.0 | F | AV | M | 720.00 | GHI000170760 | CR | 20-24 | AM | 0 | Adult | AV | 6.579251 | 6.579251 | 1.000000 | 2017 | 9 | 36 | 6 | 2 | 249 | False | False | False | False | False | False | 1504656000 |
| 31069 | 360.0 | F | AV | NE | 360.00 | GHI000222795 | CR | 25-29 | C | 0 | Adult | AV | 5.886104 | 5.886104 | 1.000000 | 2017 | 10 | 41 | 12 | 3 | 285 | False | False | False | False | False | False | 1507766400 |
| 31070 | 640.0 | F | AV | NE | 640.00 | GHI000077887 | CR | 25-29 | BP | 0 | Adult | AV | 6.461468 | 6.461468 | 1.000000 | 2017 | 4 | 17 | 27 | 3 | 117 | False | False | False | False | False | False | 1493251200 |
| 31071 | 1520.0 | F | AV | NE | 1520.00 | GHI000140573 | CR | 50-54 | AM | 0 | Old | AV | 7.326466 | 7.326466 | 1.000000 | 2018 | 7 | 31 | 30 | 0 | 211 | False | False | False | False | False | False | 1532908800 |
| 31072 | 721.0 | F | AV | NE | 721.00 | GHI000184125 | CR | 30-34 | C | 0 | MidAge | AV | 6.580639 | 6.580639 | 1.000000 | 2017 | 4 | 14 | 4 | 1 | 94 | False | False | False | False | False | False | 1491264000 |
| 31073 | 640.0 | M | RP | M | 640.00 | GHI000084252 | CR | 50-54 | AM | 0 | Old | RPU | 6.461468 | 6.461468 | 1.000000 | 2019 | 7 | 28 | 9 | 1 | 190 | False | False | False | False | False | False | 1562630400 |
| 31074 | 380.0 | M | AV | M | 380.01 | GHI000083247 | CR | 65+ | O | 0 | Old | AV | 5.940171 | 5.940198 | 1.000026 | 2018 | 1 | 2 | 9 | 1 | 9 | False | False | False | False | False | False | 1515456000 |
| 31075 | 800.0 | M | RP | NE | 800.00 | GHI000088447 | CR | 35-39 | W | 0 | MidAge | RPU | 6.684612 | 6.684612 | 1.000000 | 2018 | 1 | 2 | 11 | 3 | 11 | False | False | False | False | False | False | 1515628800 |
Most frequent
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 106.0 | M | AV | M | 106.0 | GHI000135471 | CR | 45-49 | O | 0 | MidAge | AV | 4.663439 | 4.663439 | 1.0 | 2017 | 6 | 24 | 16 | 4 | 167 | False | False | False | False | False | False | 1497571200 | 3 |
| 5 | 125.0 | M | AV | M | 125.0 | GHI000137735 | CR | 18-19 | S | 0 | Adult | AV | 4.828314 | 4.828314 | 1.0 | 2020 | 7 | 28 | 10 | 4 | 192 | False | False | False | False | False | False | 1594339200 | 3 |
| 182 | 315.0 | M | AV | M | 315.0 | GHI000250631 | CR | 25-29 | AM | 0 | Adult | AV | 5.752573 | 5.752573 | 1.0 | 2020 | 11 | 46 | 11 | 2 | 316 | False | False | False | False | False | False | 1605052800 | 3 |
| 214 | 350.0 | F | AV | PP | 350.0 | GHI001579782 | CR | 30-34 | AM | 0 | MidAge | AV | 5.857933 | 5.857933 | 1.0 | 2019 | 6 | 23 | 7 | 4 | 158 | False | False | False | False | False | False | 1559865600 | 3 |
| 229 | 360.0 | F | AV | M | 360.0 | GHI001297159 | CR | 40-44 | W | 0 | MidAge | AV | 5.886104 | 5.886104 | 1.0 | 2017 | 11 | 44 | 1 | 2 | 305 | False | True | False | False | False | False | 1509494400 | 3 |
| 585 | 900.0 | M | AV | NE | 900.0 | GHI000084252 | CR | 20-24 | AM | 0 | Adult | AV | 6.802395 | 6.802395 | 1.0 | 2018 | 1 | 1 | 3 | 2 | 3 | False | False | False | False | False | False | 1514937600 | 3 |
| 0 | 88.0 | F | AV | M | 88.0 | GHI001755649 | CR | 30-34 | AM | 0 | MidAge | AV | 4.477337 | 4.477337 | 1.0 | 2020 | 6 | 24 | 8 | 0 | 160 | False | False | False | False | False | False | 1591574400 | 2 |
| 2 | 108.0 | M | AV | M | 108.0 | GHI000135471 | CR | 30-34 | O | 0 | MidAge | AV | 4.682131 | 4.682131 | 1.0 | 2019 | 12 | 51 | 20 | 4 | 354 | False | False | False | False | False | False | 1576800000 | 2 |
| 3 | 108.0 | M | AV | M | 108.0 | GHI000135471 | CR | 40-44 | O | 0 | MidAge | AV | 4.682131 | 4.682131 | 1.0 | 2019 | 9 | 37 | 12 | 3 | 255 | False | False | False | False | False | False | 1568246400 | 2 |
| 4 | 125.0 | F | AV | M | 125.0 | GHI000620301 | CR | 45-49 | BP | 0 | MidAge | AV | 4.828314 | 4.828314 | 1.0 | 2018 | 7 | 28 | 12 | 3 | 193 | False | False | False | False | False | False | 1531353600 | 2 |